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Ab STRACT. We study distance properties of a general class of random directed acyclic graphs (dags). 
In a DAG, many natural notions of distance are possible, for there exists multiple paths between pairs 
of nodes. The distance of interest for circuits is the maximum length of a path between two nodes. 
We give laws of large numbers for the typical depth (distance to the root) and the minimum depth in a 
random DAG. This completes the study of natural distances in random DAGs initiated (in the uniform 
case) by Devroye and Janson (2009+) . We also obtain large deviation bounds for the minimum of a 
branching random walk with constant branching, which can be seen as a simplified version of our main 
00 result. 

<N 

I. Introduction 

Motivated by the circuit value problem, and the delay to evaluate the output, Diaz, Serna, Spi- 

Ph rakis, Toran, and Tsukiji [21] initiated the study of depths in random circuits. The model is that of 

uniform random circuits [19]: a random circuit is built by iterative addition of gates, each gate ran- 
domly choosing k inputs among the outputs of the gates already present. The model has been further 
studied by Arya, Golin, and Mehlhorn [4] and Tsukiji and Xhafa [36]. Writing D x for the depth of 

, , the gate arrived at step x, Tsukiji and Xhafa [36] proved that the depth a random circuit of n gates, 

mdix{D x : x < n} is asymptotic to ke log n in probability (log denotes the natural logarithm). 

Actually, in a random circuit, many distinct directed paths may link two gates, and one can 
define different notions of distances. Devroye and Janson [18] started the systematic analysis of 
these distances. Among those, the distances defined by paths that are only allowed to look one step 

ly^ ahead ("greedy" distances) were studied by Mahmoud [29] and Devroye, Fawzi, and Fraiman [20]. 

lf~) These greedy distances have permitted to quantify the effect of the "power of choice" for depths in 

random trees: like a gate, each node has k potential contacts, but only attaches to the most desirable 
according to a measure of optimality. The models in which the choice of each node is made only 

t-H according the labels of its potential contacts has been studied by Mahmoud [29] and Devroye, Fawzi, 

and Fraiman [20]. D'Souza, Krapivsky, and Moore [23] studied more general rules of growing trees, 
^ where the choice of each node might depend on the degree or the distance to the root of the potential 

J^j ancestors. 

Unfortunately, the distance that has the most meaning in terms of performance of circuits cannot 
be defined in a greedy way: the number of layers of a circuit depends on the maximum length of a 
path between an output and an input. In this paper, we study precisely this distance, and hence the 
number of layers required to evaluate the random circuit. Aside from the depth of the entire circuit 
studied in Arya et al. [4] and Tsukiji and Xhafa [36], the typical depth D n and the minimum depth 
of a gate are also of interest. For the latter quantity, since minl^ :0<x<n} = 0, we study 
minj^D^ : x > n/2} to estimate the concentration of the depths in the circuits. Our main results are 
laws of large numbers for D n (Theorem 4.1) and minl^ : x > n/2} (Theorem 5.1). In particular, 
for the model of uniform random circuits of [36] and k = 2, we show that 

D n mm n / 2 < x < n D x A 

A and '-^^ > A min = - (1) 



logn logn 2 

in probability, where A = 4.31107. . . is the only solution to the equation Alog(2e/A) = 1 that 
is greater than one. Doing so, we prove a conjecture of Devroye and Janson [18] about the value 
of A, and we also identify A m i n for which they did not have a guess. This completes the study of 
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some natural distances in uniform random circuits started in [36] and [18]. In fact, our results apply 
to a more general class of random DAGs where the parent nodes of x are not necessarily chosen 
uniformly from {0, . . . , x — 1} [20] (we will be more precise shortly). In general, the limit constants 
are characterized uniquely as the root of some (often implicit) equation that depends on the precise 
model of attachment. 

The problem of distances in random DAGs is related to minima in branching random walks. The 
relation between distances in random trees such as random recursive trees, or binary search trees and 
minima in branching random walks has been exhibited by Pittel [32] and Devroye [16]. Distances 
in random circuits can also be studied using the simpler setting of branching random walk, but the 
relation is much more intricate because the circuits do not have a real tree structure. Although we do 
not use the results about branching random walks directly, we think that the reader may find it useful 
to warm up with this simpler model. Moreover, the ideas leading to the tail bounds for minima 
in branching random walks presented in Theorems 3.2 also underly the main argument behind our 
analysis of the behaviour of min n / 2 <^< n D x . 

Further bibliographic remarks. For a slightly different random circuit model, Mahmoud 
and Tsukiji have investigated the asymptotic behaviour of the number of outputs, that is gates that 
do not feed in any other gate [30, 35]. The profile of the related model of /c-trees has been studied 
by Darrasse, Hwang, Bodini, and Soria [14]. Depths in random circuits are also used by Codenotti 
et al. [13] in relation to parallel computations time. 

Outline of the paper. The model of random dag is formally introduced in Section 2. In 
Section 3, we study tail bounds for minima of branching random walks. The results presented there 
help understand why the values for the limiting constants in the law of large numbers in (1) are 
what they are. To the best of our knowledge, the exponential rates in the tail estimates we derive 
for the branching random walk were not known before. Section 4 is then devoted to the study of 
the typical number of edges D n on the longest path between n and the root of a random DAG. 
Finally, the minimal distance between a node (with sufficiently large label) and the root is analyzed 
in Section 5. Although we do not think that the results for branching random walks are obvious, 
the main difficulty consists, in the DAG model, in dealing with the intricate dependence between the 
different paths up the root that originate not only from a single node, but also from different nodes. 

2. Definitions and notation 

We consider the more general model of scaled attachment random recursive DAGs (&-SARRD) 
introduced in [20]. We are given a random variable X, with support in [0, 1). In a /c-SARRD with at- 
tachment X (or (X, /c)-SARRD), every node x chooses k parents: |_#Xe,iJ , |_ x ^,2j , • • • , |_ x ^,fcJ 
where Xo,i, . . . , Xo 5 &, . . . , X Ui i, . . . , X n ^ are independent copies of X. In other words, the ran- 
dom variable X XiP determines the p-th parent of node x. A random DAG of size n is then composed 
of the root and the nodes 1, . . . , n and the edge set binding each node to its k parents. When X is 
uniform, one obtains the uniform recursive circuit (fc-URRD) that is the subject of [4, 18, 36]. 1 

In the following, we always reserve k for the number of parents in the DAG. We let D n represent 
the length of the longest path from node n to node 0. 

We now introduce some notation to describe the DAG. The set of finite words on the alphabet 
A= [k] = {1, . . . , k} is denoted by 

U := (J A m . 

m>0 

The set U is naturally endowed with a partial order: we write v ■< u if v is a prefix of u. We will 
also think of U as a /c-ary tree, where v ^ u if v is an ancestor of u. 

For a node x and a string s G U, L(x, s) is the label of the ancestor of x obtained by following the 
path labeled by s. For example, L(x,l) is the first parent of node x. Concatenation of strings (that 
correspond to path) is denoted by • . Note that in our model we have L(x, s-p) = [L(x, s)X L ( x ^ s ^ p \ . 



One can also consider close variants of the model where the parents are chosen without replacement and the DAG has k 
roots [19]. 
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For a string s and i > 0, let s { be the string composed of the first i letters of s. In the special case 
where i = |s| — 1, we write s~ := S| s |_ 1 for the string where the last letter is dropped. The last 
letter of the word s is denoted by s so that s = s~ • s. 




Figure 1. A binary dag on {0, 1, . . . , 6} and the corresponding exploration tree from node 6. 



Of course, the paths up the root corresponding two strings s and s f might intersect, so that even 
when s and s' have no non-trivial common prefix, the random variables L(n, s) and L(n, s') are 
not independent in general (see Figure 1). However, the dependence of random variables L(n, s), 
s G U is "essentially caused" by common prefixes of the strings s. We will later justify precisely 
this informal fact as we prove our main results in Sections 4 and 5. Understanding the lengths of 
the paths originating from a node n hence somewhat reduces to the analysis of the evolutions of the 
labels L(n, s) in the /c-ary tree U. Furthermore, the reader should be intuitively convinced that the 
labels should satisfy the approximation 

L(n,s) w n JJ X u , 

where X u , u £U are i.i.d. copies of X, which makes the connection between paths in X-DAGs and 
a branching random walk with step distribution Y = — log X. Since we are interested in maximum 
lengths of paths, we should naturally study paths along which the L(x,s) stays large; along these 
paths, the branching random walk should be small. This leads us to the study of asymptotics for 
minima in a branching random walk. 

3. Large deviations for extremes of a branching random walk 

In this section, we consider branching random walks with constant branching factor k and step 
distribution Y. Let Y u , u G U \ be independent and identically distributed random variables 
distributed as Y. It is convenient to assume that Y = 0. Then, define the position of a word u eU 
by S u = Y v - We are interested in the minimum label over the k m nodes at distance m from 

the root, M m ~= mm{S u : u G A m }. 

Asymptotics for M m depend on rate functions associated with the step distribution Y. Let A be 
the cumulant generating function for the step distribution Y and define its convex dual A* [15, 34] 
by 

A(A) = logE[e Ay ] and A*(z) = sup{Xz - A(A)} 

where log denotes the natural logarithm. The following classical theorem describes the first order 
asymptotics of M m . In the entire document, we will always have Y > 0, so that the moment 
condition A (A) < oo for some A < will always be satisfied. 
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Theorem 3.1 ([7, 26, 28]). Suppose Y is such that A(A) < oo for some A < 0. Let 7 = mi{z < 
E [Y] : A*(z) < log k}. Then, 7 is finite and with probability 1, 

r M m 

lim = 7. 

ra-)-oo m 

Moreover, if~E [Y] < 00, by the dominated convergence theorem, we have 

v E[M m ] 

lim = 7. 

m— ^00 rri 

Results on the minima in branching random walks have many applications in the study of ran- 
dom trees. See Devroye [17] for a survey. Here, we are interested in tail bounds for the distribution 
of M m /m. McDiarmid [31] and Addario-Berry and Reed [1] have proved general exponential tail 
bounds for the deviations of M m /m\ however, the exponential rates in the bounds there are not opti- 
mal, and our setting requires to identify them. In the following, we assume that Y is asymptotically 
exponential in the following sense: there exists a constant a G (0, 00) such that 

io g F{r>x} 

> -a, (2) 

x 

as x — » 00. 

The two sides of the distribution of M m /m have in general very different behaviour. Quite 
intuitively, if one wants to make the minimum value at level m smaller, it suffices to modify the 
random variables Y u on a single path of length m, so one expects that the tail should have exponential 
tails with a scale of m on the left. On the other hand, to make M m larger, one needs to modify all 
k™ paths of length m, and it is not immediately clear how one should do this in order to optimize 
the probability. We will show that when the random variable Y has exponential tails in the sense 
of (2), it is essentially best to modify the random variables on the first level of the tree by a huge 
amount. We now turn to formalizing this intuition. 

Theorem 3.2 (Right tail). Suppose that Y > 0, logP{F > x} = —ax + o(x), as x —> 00, for 

a G (0, 00). Let e,5 > 0. Then there exists constants c,c' G (0, 00) such that, for all m large 
enough, 

c • e - ka ^ 1Jr5)m < P {M m > (7 + e)m} < c' ■ e -^e(i-S)m^ 

Proof We start with the lower bound. Consider the k nodes connected to the root. Each one of 
these nodes is the root of a tree of depth m — 1. In order to have M m > (7 + e)m, it suffices that 
all k (independent) trees have a minimum M m _i > (7 — and all the steps between the root 

and its children are such that Yi, . . . , Y^ > (e + ^)ra. Therefore, 

P {M m > (7 + e)m} > P {M m _! > (7 - eS/2)m} k • P {Y > (e + eS/2)m} k 

> c ■ e _/cQ; ( £ +¥) m + ( m ) 

> c • e _/cQ!£ ( 1 + (5 ) m 

for m large enough, by Theorem 3.1 and our assumption on the tail of Y. 

We now prove the upper bound. Let h = [10 \og k m\ . We start by proving an exponential tail 
bound for S u where u G A h . Notice first that 

poo fOO 
E [ e a(l-«)y] = / V{e a(lS)Y > t}dt = / t -l/(l-S)+o(l) dt K ^ 

Jo Jo 
Thus, by Markov's inequality, we obtain 

W{S U > (1 - S)em} < E[ e <*(l-6)Yyi e -a(l-6).(l-6)em < e -(l-5fasm^ 

for m large enough. We can now prove exponential tails for M m . Using a decomposition according 
to the values of some nodes at level h + 1, we obtain 

P{M m > ( 7 + £)m} < P{Vp G [k],3u G A h : S p . u > (1 - S)sm} (3) 

+ P{3p G [k],Vu G A h : S p . u < (1 - ^mandM m > ( 7 + 6:)m}. 
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For the first event in (3) to hold, there must be k paths u 1 , . . . , u k such that, for the k disjoint paths 
p • u k at level ft + 1, the values are rather large: 

P{Vp e [k],3u e A h : S p . u > (1 - 5)em} < P {\fp e [A], 3u e A h : 5 p . n > (1 - J)em} 

< fc • ft- fc(1 -*' 3aem . (4) 

On the other hand, 

¥{3p e [fc], Vu e ^ : # p . n < (1 - £)era and M m > (7 + e)m} 

= P{3p g [fc], Vu e A h : [5 P . U < (1 - 6)em and Vv e A™' 11 ' 1 : > (7 + e)m] } 

<P{M m _ /l _i > ( 7 + fe)m}^ 

— c 

for m large enough and some constant Co < 1 (using Theorem 3.1). As a consequence, for fixed 
(5, e > 0, we have for all m large enough 

P{M m > ( 7 + e)m} < k kh+1 e- k ^^ aem + cg h . 

By our choice for ft = [lOlog^ mj we have /c^ > m 10 /k. Finally, since S > was arbitrary, the 
desired upper bound follows. □ 

At this point, we should comment on tail bounds for minima in branching random walks. Al- 
though the estimates for the upper tail of M m in Theorem 3.2 are tight, we must mention that the 
strength of the result does not really compare to recent results on the precise location of minima in 
branching random walk. Indeed, Theorem 3.2 only provides decent estimates for P{M m > 7m + i] 
for positive t, while E[M m ] = jm—f3 log m+0(l) [1]; for the study of branching random walks the 
tail bounds of interest are actually those for P{M m > E[M m ] + 1} or P{M m > 7m - j3 log m + 1}. 
See also the related results about tightness and weak convergence for M m [2, 6, 8, 9, 27]. Equivalent 
results for the height of random trees (binary search trees and m-ary search trees) were proved in 
[10, 22, 33]. Similar comments apply for our estimates on the left tail that follow. 

The left tail for the minimum at level m in a branching random walk is essentially governed by 
the level m: To change the minimum M m , it suffices to change one single of the k m paths of length 
m and this way of proceeding is essentially optimal. 

Theorem 3.3 (Left tail). Suppose that there exists A < such that E[e Ay ] < 00. Then, for any 
S > 0, there exists constants c, d such that 

ck m e -m(A*( 7 -e)+5) < p {M m < (7 - e)m} < J k™ e'™^ ^~ S \ 

Proof. The upper bound follows easily from the union and Chernoff 's bound [11, 15]: 

P{M m < (j-s)m} < k 11 
< k 71 

The lower bound is proved using a branching process argument. Let L > 1 be an arbitrary integer 
to be chosen later. The potential individuals of our branching process are the nodes of U at levels 
iL, i > 0. A node u is called good if either it is the root (lies at level 0), or it lies at level (i + 1)L for 
some i > 0, its ancestor v at level iL is good and S u — S v < (7 — e)L. Let Z { denote the number 
of good nodes at level iL in the tree ; {Z iy i > 0} is a Galton-Watson process. Clearly, if there is a 
good node at level iL, i.e., Zi > then Mil < (7 — e)iL. As a consequence, 

P {M iL < (7 - e)iL} > P {Zi > 0} . (5) 

By the second moment method [3], more precisely the Chung-Erdos inequality [12], we have 

P{Z, > 0} > -y^ = (6) 

J -E[Z Z 2 ] Var(Z z )+E[Z,] 2 
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For the Galton-Watson process {Z iy i > 0}, one has 



2 W 



E[Zi] = // and Var(Z*) =a z - — — /x t_ \ 

1 — /i 

where /i := E [Zi] and a 2 := Var (Zi) [5]. We now move on to choosing L to obtain a good lower 
bound on the right-hand side in (6). The mean number of children of an individual is 

/i = E[Z x ] = Jfe L P j^l^ < (7-c)lJ =k L e- LA *^- £ ^°( L \ 

as L — » oo. Note that since e > 0, we have /i < 1 for any L > 1. So, for any 5 > there exist L 
large enough that 

M > fc L. e -L(A*(7-e)+*). (?) 

Fixing this value for L, and since /i < 1, (6) yields P{Z^ > 0} > C/i*, for some fixed constant 
C > independent of z. More precisely, together with (5), we obtain 

P{M iL < (j-e)iL} > ck iL e~ iL ^^- e)+5 \ 

which proves the lower bound for all m = iL, for some i > 0. Finally, when m = iL + r for 
r G {1, . . . , L — 1} it suffices that < (7 — s)zX and Y^j=i — (7 — s ) r ' wnere J > 1 are 
i.i.d. copies of Y". Since 




\Yi < >0, 

the result follows easily for general integers m. □ 

About the limiting constants for the dag model. Before proceeding to the analysis 
of the circuit model, we use the tail bounds we have just devised to provide rough arguments that 
explain the values of the constants in our laws of large numbers. (The symbol ~ is used in a very 
informal way.) The first main idea is that the dependence is small enough that the upper bound given 
by the union bound essentially yields the correct constant. By Theorem 3.3 we (should) have 

¥{D n > clogn} « P{M [clognJ < logn} 

^ ^clogn m e -clognA*(l/c) 
_ n c(logk-A*(l/x)) m 

In particular, we have lim n ^ 00 P {D n > c log n} = for every c such that A*(l/c) > log k. This 
suggests that we might have D n ~ A& log n in probability for A& = sup{x : A*(l/c) < log k}. The 
proof of this fact is the topic of Section 4. Similarly, using Theorem 3.2 we obtain 



min D x < c log n > < nP {D n / 2 < c log n} 

~P{M LclognJ >log(n/2)} 
~ ne~ kaclogn<yl / c ~ 7 ^ 



n l-ka(l-cj) 



so that it should be the case that 

A 1 



— » 1 — - — A/e in probability. 



log n \ ka / 

We prove this formally (when the limit constant above is non-negative; min^ D x is non-negative) in 

log n 



Section 5. In a similar way, we can see that Theorem 3.3 leads to a correct guess that max x < n ^f- — >> 



ke in probability when the attachment distribution is uniform [36]. 
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4. Longest paths in random /c-DAGs: typical distance D n 

We start by studying the length D n of the longest path from node n to the root of the tree. The 
typical distance D n is studied using methods similar to the study of the typical shortest path distance 
in [18]. Let A* denote the rate function associated with the random variable Y = — log X. Define 

X k =sup{z > l/E[-logX] : A*(l/z) < log/c}. (8) 

Note that we will consider attachment distributions with bounded density / and thus E [— log X] = 

— Jq 1 (log x)f(x)dx < oo. Moreover, as X < 1, E [— logX] > 0. 

Theorem 4.1. Let k > 1. Suppose X E (0, 1) has a bounded density. Then, the longest path from 
node n to the root in a /c-SARRD with attachment X satisfies, as n — >• oo, 



log n 

in probability. 

The theorem follows from Lemma 4.2 and Lemma 4.3 below. From Theorem 4.1, we easily 
obtain the asymptotics for the typical distance in the uniform circuit model discussed by Arya et al. 
[4] and Tsukiji and Xhafa [36]. Indeed, if X is uniform (0, 1), then the parents of a node i are i.i.d. 
uniform in {0, 1, . . . , i — 1}. In this case, — logX ~ Exponential 1) and A*(z) = z — 1 — log z. 
Then is the only solution of the equation in z 



( 



ke\ z 

= e 

z J 



that is at least 1. Numerical values are presented in Table 1. 



k 


2 


3 


4 


5 


10 


A*; 


4.311070407... 


7.080786915... 


9.820440021... 


12.55049054. . . 


26.16346184... 



Table 1. Some numerical values for the constant Xk for the case of uniform recursive circuits. 



4.1. Upper bound on D n . 

Lemma 4.2. For any c > there exists C and r] > such that for all n large enough, we have 

P{L> n > c log nl^Cn" 77 . 

Proof. To simplify the notation, we prove P {D n > c log n + 2} < Cn -r? for all c > which is 
an equivalent statement. Let R n denote the number of hops to the root for a random path from n, 
i.e., a path that choses a uniformly random edge at every step. By the union bound, we get 



¥{D n > clogn + 2} < /c clogn+2 -¥{R n > clogn + 2}. 



(9) 
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Let Xl, X2, • • • be i.i.d. copies of X. Then, to bound the right hand side, we let t = [clog n] and 
apply Markov's inequality 

¥{R n > clogn + 2} < ¥{R n > t) 

<P{nXi...X t > 1} 

< inf n x E[X x ..X f x ] 

~ A>0 L 1 t J 



inf n x E[X 

A>0 



Ait 



= |nf exp ^Alogn + A(— X)t^ 

< inf exp (^Alogn + A(-A)clogn) , (as A(-A) < 0) 

< exp ( — sup < — — — A(— A) 1 clogn 

V a>o I c J y 



= exp(-cA*(l/c)logn) . (10) 
Combining (9) and (10), we get 

¥{D n > clogn + 2} < k 2 exp (clogn (log k - A*(l/c))) 

= fe 2 n c(logfc-A*(l/c)) B 

As A* is a decreasing function on (— 00, E [— log X]) and c > A&, we have A*(l/c) > log k, which 
completes the proof. □ 

4.2. Lower bound on D n . The objective of this section is to prove the following lemma. 

Lemma 4.3. For any fixed £ > 0, we have 

F{D n <(l-e)\ k logn}^0, 

as n — >• 00. Furthermore, if there exists A > such that E[X _A ] < 00, then there exists constants 
C and r] > such that for all n large enough 

¥{D n < (1 -e)X k logn} < Cn~\ (11) 

We start by describing the proof strategy and each subsection covers part of the proof. Consider 
the set of ancestors of node n. These nodes form a directed acyclic graph where each node has 
out-degree k except for the root. We start by considering the following model for the ancestor DAG. 
Define a family of independent random variables X' s for all strings son^4 = {l,...,fc} that have 
the same distribution as X. For a string s of length i, write s , s l5 s 2 . . . , s £ for the prefixes of s of 
length 0, 1, . . . , i respectively. The ancestor of node n obtained by following the path s is labeled 

L'(n )S )=L...LL^J^J"-^J- 
This defines a tree (indexed by the strings s) where distinct nodes can share the same label. In this 
ideal model, the length of the longest path that reaches a node labeled can be obtained with little 
effort from Theorem 3.1. 

In our SARRD, the labels actually correspond to nodes, and there is a unique node labeled i, for 
i = 0, . . . , n. The tree of ancestors of a node is then actually a DAG, as the in-degree of a node can 
be more than one. In particular, this creates dependencies between the random variables X along 
different paths even if they have no common prefix. 

In order to avoid having to deal with these dependencies, we use the following strategy to find a 
long path. Note that in our ideal setting, a path to the root corresponds to a path to a node labeled 
0. Starting at node Vo = n, we look at all the ancestors of n of order t (i.e., £ jumps away from n) 
for some well-chosen large constant L From all these possible paths, we pick the path (of length £) 
that reaches the node V\ with the largest label. Then the same process is repeated for this node until 
a node with label is reached. This strategy defines a path, and we show that the length of this path 
can be made as large as (A& — e) log n with high probability. The advantage of using this method is 
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that only a small portion of the tree is visited so it is easier to bound the "collision" probability. The 
path constructed for the ideal model can then be shown to be exactly the same in a real SARRD with 
high probability. 

4.2.1. The ideal tree. In this section, the objective is to obtain a good lower bound on the length 
of the longest path from node n to a node of label in the ideal tree (Lemma 4.4). More precisely, 
let Vj denote the label obtained after j steps (each one composed of £ jumps); the string defining 
the path from n to the corresponding node is denoted Sj G A^ £ . We have V$ — n, So = and for 

j>0, 

Vj+i = max Z/(n, Sj ■ s) and = Sj • argmaxZ/(n, Sj • s). (12) 

seA* seA e 

The objective is now to show that starting from n and after q — |~(1 — e)Xk \og{n)/£~\ steps of 
£ jumps, no node of label zero has been reached with probability going to 1. If this happens, we 
clearly have a path of length at least q£ > (1 — e) \k log n between n and a node labeled zero. 

Lemma 4.4. Suppose E [— logX] < oo. Let e G (0, 1). For q = — e)Xk \og(n)/£], we have 

¥{V q < n £/4 } -> 0, 

as n — > oo. Furthermore, if there exists A > such that E[X _A ] < oo, then, there exists r] > such 
that for all n large enough, we have 

F{V q < n £/4 } < n' 71 . (13) 

Proof Recall that for a string s, we write s Q = 0, s t , s 2 , . . . for the prefixes of 5 of length 0,1,2,..., 
respectively. Thus, for j > 0, we have by definition 

V j+1 = m^{[[VjX f Sr J • -X' Sj . s J :seA 1 } 

> Vj • max{X^.^ i x • • • x X' Sj .^ : s e A 1 } - £. 

Letting = rnaxfl^.. x • • • x X^.. S£ : 8 G ^}, we have 

9-1 



i=o 



It follows immediately that 



¥{V q < n £/4 } < P <^ n TT Zf - qt < n s/4 




<P^-logZf > (l - |) log7 



^=0 

for all n large enough. Observe that — log Zq is nothing but the minimum at generation t of a branch- 
ing random walk with increments distributed as — log X . As E [— log X] < oo, using Theorem 3.1, 
we get 

_ i (14) 

We pick £ = £ (£) so that for £ > £ , we have (1 - f )A fe E[- log Z^} < £. In the rest of the proof, 

£ = £q is fixed and we let Z be a random variable having the same distribution as Zq . 

We can now use a limit theorem for renewal processes (see for example [24], Chapter 10.2, 
Theorem 1, see also [25]). We have the convergence in probability: 

maX { 9 : - log Zf < (1 - f ) logn} v 1 (l-f)A fc 



(1- |) logn E[-log4 £) ] 
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where the last inequality follows by our choice for L Thus, with probability going to one, and for 
q = [(1 — e)Xk log n/f\ we have 

lim ¥{V q < n £/4 } = 0. (16) 

n— )>oo 

We now move on to the proof of the tail bound (13). Note that, in a stochastic sense, we have 
Zq > X\ ■ X2 • • • Xt, where Xi are i.i.d. copies of X. Hence, since £ is fixed, there exists 77 > 
such that E[Z _r? ] < 00 so that, by Markov's inequality, 

E [ e -TjlogZ] 




r}£(l-e/3)/(l-e/2)\ k 

for all n large enough. To complete the proof, observe that, as 77 — » 0, we have 

E[e _r7logZ ] f^r , ^ (l-e/3)£ \ 
L J -1 + 77 E[-logZ]-^ ^- +0(77) 



e ^(l-e/3)/(l-e/2)A fe ' '/ ^ l (1 - e/2)A fc 

£ / 1 1 



for 77 small enough by our choice for l\ it is routine to verify that the second term is indeed negative 
for any e G (0, 1). Since q = [(1 — e) \og{n)/£\ , the tail bound (13) follows readily. □ 

We call the event studied in this section A n (q(n)) := {V q ( n \ > n £//4 }. To make the notation 
lighter, the dependence in n will be omitted. 

4.2.2. The real DAG: handling collisions. Recall that the statement in Lemma 4.4 only deals with 
the ideal tree model. In order to prove that a long enough path also exists in the real DAG, we couple 
the SARRD and the ideal tree in such a way that with high probability, the path S q that we exhibited 
in the ideal tree can also be found inside the SARRD. What we mean here is the following: the path 
S q defines a sequence of distinct labels in the ideal tree; in the SARRD the nodes corresponding to 
these labels are also along a path with high probability. 

We use the random variables X' s that were used in the ideal tree, and new independent random 
variables X" p for x G {0, . . . , n} and p G A. We start by defining an ordering of strings: 

s < s' if and only if |s| < \s'\ or |s| = \s'\ and s <i ex s' (17) 

where <i ex is the lexicographic order on the strings on the alphabet A. This is the breadth-first order 
in the fc-ary tree. Then, if one of the paths from n has label x in the ideal tree, let T x denote the first 
(or breadth-first) such path: 

T x = min{s : L'(n, s) = x, \s\ < n}, 

where the min is taken with respect to the order in (17) and we agree that min{0} = oo. The labels 
in the ideal tree are not distinct, and the corresponding vertices cannot directly represent nodes in 
the DAG. We now ensure that a node x in the SARRD corresponds to the^r^st vertex in the ideal tree 
with label x (if such a vertex exists): for all p G A let 



7^ oo 
= oo. 



Clearly, the random variables X x # are independent and distributed as X, so that they define a proper 
(X, /c)-SARRD. 

We define the event that the sequence of labels along the path S q is the same in the DAG generated 
by the variables (X x ^ p : < x < n, 1 < p < k) and in the ideal tree generated by {X' x : < x < 
n,l < p < k) 

B(q) = {L'(n, s) = L(n, s) : s ^ S q } . 
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In particular, if B(q) and A(q) both hold, then the path S q does not reach the root since its 
minimum label is at least n £ / 4 and since S q is also a path in the DAG we have D n > qt. Write E c 
to denote the complement of an event E. Then, since q— |~(1 — e) Xk log(n)/£] , we have 

P {D n < (1 - e)X k log n} < P { A(q) c U 

= F{A(q) c } +F{B(q) c D A(g)} . (18) 

Lemma 4.4 bounds the probability that does not hold so that to prove Lemma 4.3, it suffices to 
bound the probability that B(q) c D A(q) occurs. 

Lemma 4.5. Let e G (0, 1) and let q = q(n) = \(1 — e)Xk \og(n)/£]. Then, there exists rj > 
such that for n large enough 

¥{A(q)nB(q) c } < n' 71 . 

Proof. In the following, we write n± = n £ / 4 . Recall that s~ denotes the prefix of s of length |s| — 1. 
If the path S q is a breadth-first path in the ideal tree (in the sense that each of its vertices is the first 
with its label) then our coupling ensures that B(q) occurs. So for B(q) to fail, there must be a node 
in S q that is not the first with its label. We have 

¥{B(q) c H A(q)} <¥{3j <q,\s\<£: T u{v ^ s) £ s,T L , (Vija -) = s~ , V q > m} 

<^P{3 S : \s\ <^T L , (y . 5s) ^ S ,T L , (y . 5S -) =s~,V q >m}. (19) 
i=o 

For any fixed j < q, we have by the union bound 

< p { 3s o < s : L'(V j: s ) = L'(V5,s),T L , (Vjii -) = s",^ > m} 

M<<? 

^ E E p {% s «) = L^'O^OJ ,2L'(v ilS -) = V, > »i} ■ (20) 

|s|<£s <s 

We now condition on L'(Vj,s~) and on L'(Vj, so) and use the independence between the random 
variable X s and (£'(V,-, s"), L (Vj, s )): 

P{L'(V i)So ) = [X^'(y j)S -)J ) r i((Vjl ,-)=a- 1 7, > m} 

< ^ P{w= L^J,^'(^,s")=^i'('l / i,so)=t',i , (^,s-)>n 1 } 

u,i><n 

< sup p{^ = L^CJ}- 

n>ni ,v<n 

Since X£ has a density bounded by 6, we have, for any v < n and u <n\, 

¥{v = \uX'\} = W{v <uX' <v + l}< - < — . 

Thus, going back to equations (19) and (20), since there are at most qk 2 ^ +1 ^ terms, each one at most 
bj 774, we obtain 



>{B(q) c nA(q)}< 



Tlx 



which readily yields the result since ri\ = n £ / 4 , q = [(1 — e)Afc log(n)/^] and 6, fc,^ are fixed 
constants. □ 

Thus, recalling Lemma 4.4 and (18), we have 

lim P {D n < (1 - e) X k log n} = 0. 



12 



NICOLAS BROUTIN AND OMAR FAWZI 



In the case when the attachment distribution X is such that E[X x ] < oo for some A > 0, there 
exists r] > such that the following stronger bound holds for all n large enough 

¥{D n < (l-e)X k \ogn} <n~\ 

which finishes the proof of Lemma 4.3. 

5. Longest paths in random /c-DAGs: minimum distance min n / 2 <^< n Ac 

In this section, we suppose that the attachment distribution X has a bounded density and satisfies 

P{X <t} = t a+o{1 \ (21) 

as t -> 0, for some a e (0, oo). Note that this implies that E[X~ a / 2 } = ¥{X~ a ^ > t}dt < 
oo. The lower tails of the step size should clearly influence the distances; the present setting with an 
underlying branching structure points towards the dependence in (21). The value of a determines 
the value of the minimum distance for a random recursive (fc, X)-DAG. Define the constant 

P := max ^1 - -L,0^ . (22) 

Theorem 5.1. Let k > 2 and X be as above. The minimum longest path distance in a random 
recursive (fc, X)-DAG satisfies 

mm n/2<aKnAE n . 

7^ ^A fc , 

logn 

in probability, where X k = sup {z : A*(l/z) < log k} is the constant defined in (8). 

In the case where P {X < t} > t a+ °^ for any a > 0, we obtain a limit of 0. In fact, the 
upper bound in Lemma 5.2 only uses a lower bound on ¥ {X < t) for t in the neighborhood of 
0. Similarly, if P {X < t) < t a+ °^ for any a > 0, the limit becomes as the lower bound of 
Lemma 5.3 only uses an upper bound on P {X < t} for t in the neighborhood of 0. 

For X ~ uniform[0, 1), we have a = 1 and (3 = 1 — 1/k. This yields min n / 2 <a;<n D x = 
(k — l)/k • Xk log n) for uniform random recursive DAGs. In particular, for k = 2, we obtain 

D n mm n /2 <x<n D x \ 2 
" Xo and ~ = > 



logn logn 2 

where A2 = 4.31107 This completes the table of asymptotic properties of different natural 

distances in uniform random DAGs [18, Table 1]. 

The reader will easily be convinced when reading the proof that the result remains unchanged if 
one considers minlZ^ : Sn < i < n}, for some S > 0. We keep the current statement for simplicity. 

5.1. Upper bound on min n / 2 <x<n Ac- 
Lemma 5.2. For any e > 0, there exists n > such that, for all n large enough, 



min D x > (3 + e)\k logn > < n 71 . 

n/2<x<n J 

Proof. The strategy is to show that there exists a node x G { \n/2] , . . . , n} such that all its parents 
have label at most n^ +£ . We then conclude using Theorem 4.1. For n/2 < x < n, we look at the k 
parents L(x, 1), . . . , L(x, k) of x in the DAG. Since f3 > 1 — ^ and x < n, we have 

P {max{L(x, 1), . . . , L(x, k)} < n^ £ } = P {xX < n p+£ } k 

>P{X<n- 1 /^ +£ } /c 

> n -l+kas/2 

for all n large enough, as X satisfies (21). Define 

N = \{x : [n/2] < x < n, max L(x,i) < n^ +£ }\. 

1 L Ki<k J 1 
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We have 

E [TV] > (n - |"|] + l) • n' 1 ^' 2 > (23) 

Observe that the events {max(L(x, 1), . . . , L(x, 1)) < 77/ 1 - 1 /( fcQ! )+ £ } a re independent for different 
nodes x. So we can also compute the variance of TV: 

n 

Var(TV) < ^ P {max{L(x, 1), . . . , L(x, k)} <n^ £ } = E [TV] . 

x=\n/2\ 

Thus, using the second moment inequality, sometimes called Chung-Erdos inequality [12], and 
since E[TV] —> oo by (23), we obtain 

P{TV = 0} < Var (7V) . < = 0(n- fca£ / 2 ). (24) 

1 J " Var (TV) +E [TV] 2 " 1+E[TV] V ; 

Now, define the random node 

V = max {{0}u{x: \n/2] < x < n, max(L(x, 1), . . . , L(x, k)) < n^ +£ }} . 

It only remains to show that such a node V has a small longest path to the root. It is sufficient to 
bound the following probability, with e < 1. We have 



{ 



min D x >(@ + 3e)X k log n + 1 

n/2<cc<n 



<P{LV > (/3 + 3£)A fc logn + l,TV>0}+P{TV = 0} 

fc 

< P > (P + 3e) A fe log n, TV > 0} + P {TV = 0} 



s=l 

k 



<^>{^l(v, s) > (P + 3e)\ k \ogn,L(V,s) < n^} + P {TV = 0} 



8=1 



< k¥ {D [n p + e } > (1 + e)(P + e) A fc log n} + P {TV = 0} for e < 1. 

The last inequality holds because we can condition on L(V, s) and use the independence of the 
events {L(V, s) = v} and {D v > (1 + + e)Afc logn} for any v. We can then conclude using 
Lemma 4.2 and (24). □ 

5.2. Lower bound on min n / 2 <x<n Ac- The objective of this section is to prove the following 
lower bound. Note that when fj = 0, no lower bound is needed to prove Theorem 5.1, so that we 
can safely assume here that (j > 0. In particular, we have /? = 1 — 

Lemma 5.3. Suppose that f3 > 0. For <2?ry £ > 0, exzsfa 77 > swc/z 
pi min < ((3-e)\ k logn \ = 0(n _r7 ). 

|^n/2<x<n J 

Clearly, it is sufficient to consider e < (3; we do so until the end of the proof. By the union bound, 
we have 

p{ min D x < ((3 — s)\ k logn j < V V {D x < (J3 - s)\ k logn} 

I n/2<x<n * — ' 
k 1 J x=\n/2~\ 

< n • sup P {D x <(/3- e)\ k log n} . 

n/2<x<n 

So it suffices to show that ¥{D X < (f3—£)\ k \ogn} = o(l/n) uniformly for all x G {[^/2] , ...,n}. 
In the rest of the proof we fix x G { \n/2] , . . . , n}. 

If D x < (f3 — e)X k logn, this means that all the ancestors a of x of order h (think ofh = 
[10 log log nj ) have depth D a < (f3 — e)X k log n — h. To bound the probability of such an event, 
we show that there are many distinct ancestors of order h that have labels at least n^~ 5 (Proposition 
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5.8 that is based on Lemma 5.7 and 5.5). Then using the explicit bound in Lemma 4.3, we prove 
that the probability that none of these ancestors have direct parents that have typical depth is small. 

More precisely, we build a small DAG of ancestors of x up to h = [10 log log nj generations. We 
start by showing that many of the ancestors of order h have labels at least n^~ 6 . As a warm up, we 
first bound the probability that a single ancestor h levels away from x has a low label. 

Lemma 5.4. Let x > n/2 be fixed. Let s G [k] h where h = [10 log log n\ and u > h. For any 
r] > and n large enough, 

¥{L(x,s) < u} < (-J 
In particular, for 5 G (0, (3) and u = n^~ 5 , we get 

P{L(x, s) < n?- 6 } < n - 1 /fc-«5/2 > (25) 

Proof. We have the following bound 

F{L(x, s) < u} < P{nXi ...X h -h<u} 

<^{X 1 ...X h <-} 

= F{(X 1 ...X h y>(2u/ny}, 

for all n large enough and any 7 < 0. By Markov's inequality and the independence of X\ , . . . , Xh, 
it follows that 

¥{L(x,s) <n^- 5 } < (2u/n)~^ • E [X x 7 . . . X^} 
= (2u/n)-^ -E[X^} h . 

Now observe that, for any e > 0, we have 

/>OC 

E[X- a+£ ] = / F{X~ a + £ > t}dt < 00, 
Jo 

by our assumption on the tail of X in (21). Choosing 7 = — a + e for small enough e, we obtain 

F{L(x,s) < u} < (2u/n) a - ri/2 < (u/n)^ 
for n large enough. We obtain (25) by choosing 77 small enough. □ 

In order to handle the dependence between different paths of the DAG, we bound the number of 
path intersections far from the root. More precisely, order the strings s according to the order defined 
in (17) (breadth-first order). Define the set of nodes that are ancestors of x along paths indexed by 
the words s' < s\ V(x, s) = s'), s f < s}. Then we say that a path labeled by s collides if 

L(x, s) G V(x, s) and L(x,s t ) £ V(x,s t ), 1 < t < |s|, 

where is the prefix of s of length i. Of course, the chance to collide is greater if the labels are 
small; for us it will be sufficient to consider the nodes with label at least n^ -5 . Define the number 
of paths of length at most h colliding at nodes with label at least n^ -5 : 

N c = \{s : \s\ < h,L(x,s) > n^~ 6 and s collides} | . 
Lemma 5.5. Let S > 0. For all n > 1 and all i > 0, we have 

F{N C >i}< k^+^VrrW-V m 

Proof. Let i > 0, by definition, we have 

F{N C > i} =P{3si < - < Si : s u ...,Si collide} 

< k i{h+1) sup P {si , . . . , Si collide} . (26) 

We prove by induction that for alH > 

sup P{si, . . . ,Si collide} < {bk h+1 n 5 -P) 1 . (27) 

S\<---<Si 
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Clearly, (26) and (27) together imply the result. The base case, i = 0, is clear. Suppose now that 
i > 1. Let si < S2 < • • • < Si. Write Si = {si, . . . , Si} and = Si\ {s^. Note that Si-i is 
empty if i = 1. We say that a set 5 collides if all its elements collide. Recall that s denotes the last 
symbol of the word s. Then 

P {si, . . . , Si collide} 

= F{Si collides} 

= P{5i_i collides, L(x,Si) > n 1 ' 1 ^- 6 , L(x, s~) £ V(x, s~), L(x, s») G V(x,^)} 
= ^P{Si_i collides, L(a?,sr) = u,V(x,Si) = W,u<£ V(x,sr), [uX u ^\ G W) , 

u,W 

where the sum in the last line ranges on u > n 1 - 1 /^)- 6 and W C {0, . . . , n}. In the rest of the 
proof, W will always be a subset of {0, . . . , n} and we do not always remind it to keep the equations 
as light as possible. 

We claim that for any fixed W C {0, . . . , n} and any u, the event 

{Si-i collides, L(x,s^) = u,V(x, Si) = W,u £ V(x,s^)} and {L^«,sJ £ ^} 

are independent. The latter event is clearly determined by X UiP , 1 < p < k. Consider now the other 
event. In order to determine whether it occurs or not, it suffices to look at the ancestors L(x, s') of 
x for s' < Si. If the value of X^. is needed to compute one of these ancestors, then the event does 
not hold because we would then have u G V(x, s~ ). Otherwise, we can determine whether the event 
holds or not without looking at X u ^ . . 

For |s| < h, we have \V(x, s)\ < k h+1 and we can then write 

P {Si collides} < P {S^ collides} sup P { [uX\ G W} 

u>nP- s ,\W\<k h + 1 

< ¥{Si-! collides} • k h+1 bn-P +5 

< {k h+1 bn-^ 5 )\ 

by the induction hypothesis. This completes the proof. □ 

Recall that our aim is to prove that, with probability at least 1 — o(l/n), x has many distinct 
anscestors of order h, all of them should have a large enough label. To make this precise, define the 
event 

E x = {x has at least k h ~ 3 distinct ancestors of order h with labels at least n^~ 6 }. 

We want to prove that ¥ {E x } = 1 — o(l/n). We first show the following decomposition for E x . 
We have already proved that P{A/" C > 3} = o(l/n), and it will then suffice to bound the probability 
that the first event in (28) below does not occur. 

Lemma 5.6. For every x > n/2we have 

{3p G [k] , Ws p G [k] h : s p [l] =pandL(x,s p ) > n^ 6 } n {N c < 2} C E x . (28) 

Proof. The first event in (28) ensures that all the ancestor of L(x,p) of order h — 1 have labels 
greater than n^~ 6 for some p G [k]. These k 11-1 nodes need not be distinct. Consider the paths in 
the order defined by (17). By definition, if a path does not collide and has no prefix that collides, 
then its label is distinct from all the previous ones. It follows that only the paths that do have a prefix 
counted by 7V C might not have distinct labels. Here N c < 2 and it is simple to see that a pair of paths 
whose collision maximizes the number of potential duplicates are p • 2 and p • 3 if k > 3 and p • 2 
and p - 12 if k = 2. In any case, of the k h ~ x , there are at least k h ~ 3 nodes with distinct labels. □ 

To complete the proof that P {E x } — 1 — o(l/n), it only suffices to bound the probability of the 
first event in (28) not occurring. 

Lemma 5.7. Let S G (0, /3/4). For n large enough, 

F{3s\...,s k G Ik} 12 ' 1 : VpG [k],L(x,p • s p ) <n^~ 5 } < n' 1 - 5 ^. 
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Proof. Let A4 be the event that L(x, s J ) < rfi 6 for j < i. We prove by induction oni > 1, for n 
large enough 

F{Ai} < n~ i/k - 6/8 . (29) 

The base case, for i = 1, follows from Lemma 5.4. Suppose now that i > 2. The difficulty to 
prove the induction step relies in the dependence between the events Ai-\ and L(x, s l ) < n^~ 6 . 
We introduce the following notation for the paths from node x 

P(x,s) = {L(x,s 1 ),L(x,s 2 ),...,L(x,s)} and Pi-i = [J P(x,s J ). 

l<j<i 

Note that x is not included in the paths. To upper bound the left-hand side in (29), we condition on 
the first time that the path P(x,Si) reaches the set Pi-\\ 

P{4 l } = P{4Pfe^)nP i _ 1 = (J}+ P{Ai,i(^«i-i) i Pi-i,L(x,s_i) e P^}. 

l<t<h 

(30) 

In the following, W and Q will always denote a subset of {0, . . . , n}; we do not always remind it. 

i. The path P(x, s l ) does not collide. The first term in (30), on the event P(x, s z ) D P%-\ 7^ 0, is the 
easiest to deal with: 

¥{A U P(x, s') H P,_x = 0} = P {^' P ( x ' s ^ = ®> P *-i = W ^ 

W,Q:WC)Q=® 

As Q fl W = 0, the events 

{L(x, s*) < n?- 6 , P(x, s { ) = Q} and {L(x, s j ) < n^~ 6 for j < i, P;_i = W) 

are independent. In fact, the first event is in the sigma- algebra generated by {X ViP : v G Q, p G [k]} 
and the second in the one generated by {X ViP : v G W,p G [k]}. Thus, we obtain 

P{Ai,P(s,5*)riPi-i = 0} <P{A z _!}.P{L(x,^) <nP- s } 

<F{A i _ 1 }-n- 1/k - a5/2 , (31) 
where the last inequality follows from Lemma 5.4. 

ii. The path P(x,s l ) collides. We next look at the terms (30) that correspond to cases when there 
are some collisions, i.e., P(x, s z ) fl P%-\ ^ 0. Recall that t is the location of the first collision 
on P(x, s z ). In the following, we write a\ for the t-th symbol of s % (so s % = a\a\ . . . For 
t G {1, . . . , h}, we have 

P{^,L0r,5j_i) £ Pi_i,L(x,5j) G P,_i} 

We separate this sum into two terms depending on whether u < n^~ 6 or u > n^~ 6 . The sum 
over u < n^~ 5 can be bounded as in (31): since Si\ t is the first path that hits P^-i, we have 
P(x, n p i-i = Jt follows that 

|W r |<fe/iit<n^- 5 ,it^W 

<P{Ai_ 1 }-n- 1 / fe - a ' 5 / 2 , (32) 
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by Lemma 5.4. We now look at the sum over u> 5 . 

J2 £ P{^i,Pi-i = WiL^.aj.!) = u, [uX Ut< \ G 

|V^|</c^n>n/ 3 - 5 ,n^W 

^ E E E P{^i-i,i'i-i=W ) L(x,a*_J=« ) P( a :, i s*_ 1 ) = Q ) L«X Ui0 *j€W} 

n>n/3- 5 |VP|<fc/i,V^uQrW=0 
n>n /3-5 |W|</c/i,V^uQnW=0 

< V sup P{[^XJ G W} ) •¥{L(x,s i t _ 1 ) = u}-¥{A i _ 1 }. (33) 

n>n/3 - \\w\<k h / 

In the first equality, we used the independence of the events {^4i-i, P%-\ = W}, {L(x, sj_i) = 
P(x, = Q} and {X w>a * G if u £ W and Q n W = 0. Using the fact that X has a 
density bounded by 6, we have for any W of size at most kh, 

hhh 

¥{[uX\ eW}< -^. 

The next step is to bound the sum in equation (33) by considering groups of nodes in the intervals 
(2 £ [n^~ 5 \ , 2^ +1 \n^~ 5 \ ] for non-negative integers t. We have for any i integer and any r] G (0, a) 

J2 sup ¥{[uX\ G W} -P{L(x,^_i) = u} 



n=2^n/ 3 - (5 J+l 



\W\<kh 



<P{i(x,sj_ 1 ) <2 €+1 n' 3 - 5 } 6M 



2 /+i n fl-«\°'-'» bkh 



< 2 a bkh ■ (2 e n - 5 ) a -' n - 1 n- a+ri 

for n large enough. For the second inequality, we used Lemma 5.4. Now if a < 1, recalling that 

/3 = 1 — l/(ka), we can bound 

< n -a+»? n a-l/fe-a5-/3+<5 

for small enough 7/ and n large enough. For the case a > 1, we get 

n -a+77 . ( 2 V- 5 ) a -^- 1 < n- a+ri ■ (2 £ n f3 - 5 ) a - 1 < 2 a - 1 n~ 1 . 
provided 2 £ n^~ 6 < 2n. This shows that total weight of an interval can be bounded by 

sup V f sup P{ [uX\ G W} j • P{L(x, = ^} < n- 1//c - 5/3 

for n large enough, where the supremum is taken over non-negative integers £ such that 2 ir a^~ 6 < n. 
This allows us to bound the expression in (33) by P ^-V^-^/ 2 as there are at most log 2 n 

intervals. By summing this term with the term corresponding to the sum for u < n^~ 6 , we obtain 

P{A i ,L(x,5»_ 1 ) iPi- U L{xJ t ) GPi-i} ^{A^n- 1 ^-™*^ 5 (34) 

for any t G {1, . . . , h}. Now it only remains to bound the sum (30) over the different values of 
t using (31) and (34). Using the induction hypothesis to bound P{Ai_i} we obtain the desired 
induction step (29). □ 

Putting these results together, we obtain 
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Proposition 5.8. Let x G { [n/2\ , . . . , n}. Let S > 0. For n large enough, 

P^j^l-n-^ + n- 3 / 2 ^. 

We are now in position to prove the lower bound claimed in Lemma 5.3. Fix e, S G (0, 1/10) and 
ft = [10 log log nj . First, using the lower bound on the typical distance, we can say that most of the 
nodes in the tree have D y > (1 — e)X k log n. We want to show that with high probability, for every 
y such that n/2 < y < n we have D y > (/3 — 77) A & logn for arbitrarily small 77. Define the set of 
"bad nodes" that violate the property 

B={y:y>n/2,D y < (1 - e)(/3 - 2S)X k logn}, 

that we decompose in a dyadic fashion: for any positive integer r, define 

B r = {y : 2 r < y < 2 r+ \D y < (1 - e)(p - 2S)X k logn}. 

Using Lemma 4.3 and Markov's inequality, we have for every r such that (/3 — 25) log 2 n < r < 
log 2 n 

p{|S r | > ^j-^J <1006-2- r E[|/3 r |] <Cn^, 

for some C and 77 > independent of r. Recall that b is a bound on the density of X. As a result 
the event 

A = {\B r \ < 2 r /(1006) for (/3 - 25) log 2 n < r < log 2 n} (35) 

is such that P {A} = 1 — 0(n -r? / 2 ) for all n large enough. 
Thus, 

pi min D x < (1 — s)(f3 - 2S)Xk logn + ft + 1 i 

[n/2<x<n J 
n 

<P{A C }+ ^ P{A, J D aj <(l-e)( i 8-2(J)A fc logn + ft + l} 

n 

<F{A C }+ V{A,Vs£[k} h :D L{x , s) <(l-e)(f3-25)\ k \ogn+l}. (36) 

x=\n/2\ 

We now bound the term P{A,Vs e : D L(liS) < (l-e)(/3-25)A fc logn+1} by conditioning 
on the event E^. Let Si,.. ., S k h-3 denote, when E x holds, a set of paths that lead (when starting 
at x) to distinct nodes. 

P{AVs G ^ : £>L(x, s ) < (1 -£)(/? - 2,5) A fe logn + 1} 

< P {E c x } + P { A, E x , Vs e ^ : D L{XtS) <{\- e )(fi- 26) X k log n + 1} 

< P + P {A, E x , Vi G {1, . . . , fc' 1 " 3 } : DlCx.s,) < (1 - e) (jS - 2<5) A fe logn + l} 
<P{^} + P{A^,ViG{l,...,fc' l - 3 } ) pG^: Li(x,^)X i(x>Si)iP J £6} 

By taking the worst possible set B (compatible with the event A) we obtain the bound: 
V{A,E x ,Vie{l,...,k h ~ 3 },peA: [L(x,S z )X L{x ^ )jP \ G B} 

, x k-k h ~ 3 

< I sup¥{[uX\ G B}\ (37) 

\B,u>nP- s J 

where the maximization is taken over all sets B such that \B D [2 r , 2 r+1 )| < 2 r /(1006) for all r. 
Note that a key point here is that the nodes L(x, Si) for i G {1, . . . , k h ~ 3 } are distinct and all have 
labels at least n^~ 6 . It now suffices to bound the right-hand side of (37). 

Lemma 5.9. Suppose B C {0, . . . , n} such that for all r satisfying — 25) log 2 n < r < log 2 n, 

W£ ft<2V£ 

|£n[2 r ,2 r+1 )| < 2 r /(1006). 
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Then for any y > 6 , we have for n large enough 

^{\.yx\ eB}<\/2. 

Proof Let r max the largest integer such that 2 rmax < y and let r m i n be the smallest integer at least 
as large as (/? — 26) log 2 n. We then have 

''"max 

F{[yX\ eB} <¥{[yX\ <2 r ™"}+ ^ P { [yX\ e B n [2 r , 2 r+1 )} 

r=r m i n 

< P < 2n^ 25 + 1} + - • -J- (2 r — +1 + 2 r — + ■ • • + 2 r — ) 

L J y 1006 v 7 

9^max + 2 

< 36n~ 5 + 

100?/ 

< 1/2 

for n large enough. □ 
Getting back to equation (36), we get 

pi min D x < (1 - e) (/3 - 2£) A fc logra + ft + 1 

^n/2<a?<n 

n 

<p{^ c }+ 5^ (p{^} + (i/2) feh_2 ) 

= 0(n-^) 

for small enough 77 using Proposition 5.8. This concludes the proof of Lemma 5.3 and Theorem 5.1. 

6. Concluding remarks 

We studied the longest path distance in a general class of random recursive DAGs. The parents 
of node x are chosen independently and distributed as \xX\ for some random variable X G [0, 1). 
When X has a bounded density, we proved laws of large numbers for the typical and minimum 
distance. For both of these results, the upper bounds do not make any assumption on the attachment 
distribution X. We use the condition of bounded density for the lower bound when bounding the 
dependencies between the different paths up to the root. It would be interesting to extend these re- 
sults to more general distributions. More generally, under which conditions is it possible to translate 
a result in an ideal model like the branching random walk considered in Section 3 into a result about 
a real model that exhibits limited dependencies? 
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